Recent advances in hypernasal speech detection using the nonlinear teager energy operator
نویسندگان
چکیده
Speakers with a defective velopharyngeal mechanism produce speech with inappropriate nasal resonance. It is of clinical interest to detect hypernasality as it is indicative of an anatomical, neurological, or peripheral nervous system problem. While clinical techniques exist for detecting hypernasality, a preferred approach would be noninvasive to maximize patient comfort and naturalness of speaking. In this study, a noninvasive technique based on the Teager Energy operator is proposed. Employing a proposed model for normal and nasalized speech, a signi cant di erence between the Teager Energy pro le for lowpass and bandpass ltered nasalized speech is shown, which is nonexistent for normal speech. An optimum classi cation algorithm is formulated that detects the presence of hypernasality using a measure of the di erence in the Teager Energy pro les. The classi cation algorithm was evaluated using native English speakers producing front and mid vowels. Results show that the presence of hypernasality in speech can be reliably detected (94.7%) using the proposed classi cation algorithm.
منابع مشابه
Analysis of Hypernasal Speech in Children with Cleft Lip and Palate
In children with cleft lip and palate speech disorders appear often. One major disorder amongst them is hypernasality. This is the first study which shows that it is possible to automatically detect hypernasality in connected speech without any invasive means. Therefore, we investigated MFCCs and pronunciation features. The pronunciation features are computed from phoneme confusion probabilitie...
متن کاملFrequency band analysis for stress detection using a teager energy operator based feature
Studies have shown that the performance of speech recognition algorithms severely degrade due to the presence of task and emotional induced stress in adverse conditions. This paper addresses the problem of detecting the presence of stress in speech by analyzing nonlinear feature characteristics in specific frequency bands. The framework of the previously derived Teager Energy Operator(TEO) base...
متن کاملNonlinear Speech Features for the Objective Detection of Discontinuities in Concatenative Speech Synthesis
An objective distance measure which is able to predict audible discontinuities in concatenative speech synthesis systems is very important. Previous results showed that linear approaches are not very effective to detect audible discontinuities. The best result was obtained by using the Kullback-Leibler distance on power spectra with the rate of 37%. In this paper, we present two nonlinear appro...
متن کاملSpeech Signal Processing: Non-Linear Energy Operator Centric Review
In modern days the need of speech signal processing using energy operators has drawn a lot of attention for researchers. The use of Teager Energy Operator and other differential energy operators in signal processing enables us a lot of new techniques to estimate and process the speech signal in noisy environment. In this paper we shall discuss the speech signal processing using the nonlinear en...
متن کاملNonlinear Analysis and Classiication of Speech under Stressed Conditions
The speech production system is capable of conveying an abundance of information with regards to sentence text, speaker identity, prosodics, as well as emotion and speaker stress. In an eeort to better understand the mechanism of human voice communication, researchers have attempted to determine reliable acoustic indicators of stress using such speech production features as fundamental frequenc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996